Bootstrapping Lexical Knowledge from Unsegmented Text using Graph Kernels
نویسندگان
چکیده
منابع مشابه
Using Lexical Knowledge in Text Classification
This paper describes several experiments in text classification using WordNet, a rich source of lexical background knowledge available in the public domain. WordNet is used to map the original words from a text into sets based on synonym and hypernym relationships. This information is used to compute a change of representation from bag of words to hypernym density. Six binary classification tas...
متن کاملDiscovering Chinese Words from Unsegmented Text
In English written text, words are separated by spaces, but in written Chinese text, there are no such separators between words. (See Figure 1.) Thus, effective information retrieval of Chinese text first requires good word segmentation. In this paper, we investigate an efficient algorithm to discover the words and their occurrence probabilities from a corpus of unsegmented text without using a...
متن کاملText classification using Semantic Information and Graph Kernels
The most common approach to the text classification problem is to use a bag-of-words representation of documents to find the classification target function. Linguistic structures such as morphology, syntax and semantic are completely neglected in the learning process. This paper uses another document representation that, while including its context independent sentence meaning, is able to be us...
متن کاملUsing Graph-Kernels to Represent Semantic Information in Text Classification
Most text classification systems use bag-of-words representation of documents to find the classification target function. Linguistic structures such as morphology, syntax and semantic are completely neglected in the learning process. This paper proposes a new document representation that, while including its context independent sentence meaning, is able to be used by a structured kernel functio...
متن کاملAcquiring Lexical Knowledge from Text: A Case Study
Language acquisition addresses two important text processing issues. The immediate problem is understanding a text in spite of the existence of lexical gaps. The long term issue is that the understander must incorporate new words into its lexicon for future use. This paper describes an approach to constructing new lexical entries in a gradual process by analyzing a sequence of example texts. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Japanese Society for Artificial Intelligence
سال: 2011
ISSN: 1346-0714,1346-8030
DOI: 10.1527/tjsai.26.440